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Adenosine kinase homologous proteins involved in tiie regulation of 
energy homeostasis 

^ Description 

This invention relates to the use of nucleic acid sequences encoding 
adenosine kinase {referred to as ADK) and homologous proteins, and the 
polypeptides encoded thereby and to the use thereof or effectors of ADK 

10 in the diagnosis, study, prevention, and treatment of diseases and 
disorders related to body-weight regulation, for example, but not limited to, 
metabolic diseases or dysfunctions such as obesity as well as related 
disorders such as eating disorder, cachexia, diabetes mellitus, 
hypertension, coronary heart disease, hypercholesterolemia, dyslipidemla, 

15 osteoarthritis, gallstones, cancer, e.g. cancers of the reproductive organs, 
and sleep apnea. 



There are several metabolic diseases of human and animal metabolism, eg., 
obesity and severe weight loss, that relate to energy imbalance where 
20 caloric intalce versus energy expenditure is Imbalanced. Obesity is one of 
the most prevalent metabolic disorders in the world. It is still a poorly 
understood human disease that becomes as a major health problem more 
and more relevant for western society. Obesity is defined as a body weight 
more than 20% In excess of the ideal body weight, frequently resulting In 
25 a significant impairment of health. It is associated with an increased risk 
for cardiovascular disease, hypertension, diabetes, hyperlipidaemla and an 
increased mortality rate. Besides severe risks of illness, individuals 
suffering from obesity are often Isolated socially. 



30 



Obesity is influenced by genetic, metabolic, biochemical, psychological, 
and behavioral factors. As such, it is a complex disorder that must be 
addressed on several fronts to achieve lasting positive clinical outcome. 



Since obesity is not to be considered as a single disorder but as a 
fieterogeneous group of conditions with (potential) multiple causes, it is 
also characterized by elevated fasting plasma insulin and an exaggerated 
insulin response to oral glucose intake (Koltermann, J. Clin. Invest 65, 
1980, 1272-1284). A clear involvement of obesity in type 2 diabetes 
mellitus can be confirmed (Kopelman, Nature 404, 2000, 635-643). 

The molecular factors regulating food intake and body weight balance are 
incompletely understood. Even if several candidate genes have been 
described which are supposed to influence the homeostatic system(s) that 
regulate body mass/weight, like leptin, VCPI, VCPL or the peroxisome 
proliferator-activated receptor-gamma co-activator, the distinct molecular 
mechanisms and/or molecules influencing obesity or body weight/body 
mass regulations are not known. In addition, several single-gene mutations 
resulting in obesity have been described in mice, implicating genetic factors 
in the etiology of obesity. (Friedman and Leibel, 1990, Cell 69: 217-220). 
In the ob mouse a single gene mutation (obese) results in profound obesity, 
which Is accompanied by diabetes (Friedman et. al., 1991, Genomics 11: 
1054-1062 ). 



Therefore, the technical problem underlying the present Invention was to 
provide for means and methods for modulating (pathological) metabolic 
conditions influencing body-weight regulation and/or energy homeostatic 
circuits. The solution to said technical problem is achieved by providing the 
embodiments characterized In the claims. 

Accordingly, the present invention relates to genes with novel functions in 
body-weight regulation, energy homeostasis, metabolism, and obesity. The 
present invention discloses specific genes involved in the regulation of 
body-weight, energy homeostasis, metabolism, and obesity, and thus In 
disorders related thereto such as eating disorder, cachexia, diabetes 
mellitus, hypertension, coronary heart disease, hypercholesterolemia. 



dysltpidemla, osteoarthritis, gallstones, cancer, e.g. cancers of the 
reproductive organs, and sleep apnea. In particular, the present Invention 
describes the human adenosine kinase homologous genes as being 
involved in those conditions mentioned above. 

The term 'GenBank Accession number' relates to NCBI GenBank database 
entries (Benson et al, Nucleic Acids Res. 28, 2000, 15-18). 

Adenosine kinase (ADK) is an abundant enzyme in mammalian tissues that 
catalyzes the transfer of the -gamma-phosphate from ATP to adenosine, 
thereby serving as a regulator of concentrations of both extracellular 
adenosine and Intracellular adenine nucleotides. Adenosine is a hormone 
that regulates blood pressure and flow e.g. in the nephron and other 
tissues. It causes constriction of blood vessels thereby increasing blood 
pressure similar to the hormone vasopressin (ADH). Adenosine has 
widespread effects on the cardiovascular, nervous, respiratory, and 
immune systems and inhibitors of ADK could play an Important 
pharmacological role in increasing intravascular adenosine concentrations 
and acting as antiinflammatory agents. 

ADK regulates actively the hormone action through phosphorylation. The 
activity of ADK is significantly Impaired in tissues of diabetic rat. Changes 
in the activity of ADK were accompanied by alterations in Its mRNA and 
protein level. These changes depended on insulin level and were not 
related to glucose level. During the first 7 h after insulin treatment the level 
of ADK mRNA, protein and enzymatic activity in kidneys, liver and heart 
returned to normal values. The observed relation between Insulin and ADK 
expression level may suggest that Insulin increases the rate of ADK gene 
transcription. Decreased activity of ADK was associated with elevated level 
of adenosine in diabetic tissues. On the 10th day after the streptozotocin 
treatment there was a 3.5 and 2-fold increase in adenosine content in heart 
and liver, respectively. On the other hand, in diabetic kidney adenosine 
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level was elevated only by 20%. Administration of insulin to diabetic rats 
^ resulted in a drop of adenosine to the level seen in normal heart and liver 
whereas, in kidneys the adenosine content was 50% lower than that 
observed under normal conditions. The time-dependent coarse of changes 
5 in adenosine level was different from that observed for ADK activity, what 
may suggest that other factors, possibly nucleoside transporters are also 
important for controlling adenosine level in the cell (Sakowicz M. and 
Pawelczyk T., (2002) Mol Cell Biochem. 236{1-2):1 63-171), The potent 
and selective ADK inhibitor, 5'-deoxy-5-iodotubercidin, is equally effective 
10 as morphine in blocking tactile allodynia in the streptozotocin-induced rat 
model of diabetes {Lynch J. J, 3rd et ah, (1999) Eur J Pharmacol. 
364(2-3): 141-1 46-). ADK activity was unchanged in white and brown fat 
of male diabetic rats or hypothyroidism, but increased with age (Jamal Z. 
and Saggerson E. D., (1987) Biochem J 245(3):881-886). Changes in 
15 adenosine concentration may be important in changing insulin sensitivity in 
adipose tissue whereas changes in adenosine receptor number may be 
more important in muscle (Newsholme E. A. et ah, (1985) FEES Lett 
181(1):189"192). However, although a change in activity of ADK by 
insulin was described in the prior art, it has not been described in the prior 
20 art that G3PDH is directly involved in the regulation of energy homoestasis 
and thus involved in the storage of triglycerides. 

So far, it has not been described that the ADK proteins of the invention 
and homologous proteins are involved in the regulation of energy 
25 homeostasis and body-weight regulation and related disorders, and thus, 
no functions in metabolic diseases and dysfunctions and other diseases as 
listed above have been discussed. 

In this invention, we demonstrate that the correct gene dose of ADK is 
30 essential for maintenance of energy homeostasis. The fly Drosophila 
melanogaster was used as model organism for the identification of proteins 
involved in the energy homeostasis. Drosophila melanogaster is one of the 
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most Intensively studied organisms in biology and serves as a model 
system for the investigation of many developmental and cellular processes 
common to higher eukaryotes, including humans (see, for example, Adams 
et al., (2000) Science 287: 2185-2195). The success of Drosophila 
5 melanogaster as a model organism is largely due to the power of forward 
genetic screens to identify the genes that are involved in a biological 
process (see, Johnston Nat Rev Genet 3: 176-188 (2002); Rorth, (1996) 
Proc Natl Acad Sci U S A 93: 12418-12422). In this invention, we have 
used a genetic screen to identify, that mutations of ADK homologous 
10 genes cause changes in the body weight which is reflected by a significant 
change in the triglyceride levels. Triglycerides are the most efficient 
storage for energy in cells, and are significantly increased in obese 
patients. 

15 In this invention we refer to the proteins encoded by Ady43A, adenosine 
kinase (ADK), or ADK-like proteins, which includes Drosophila and 
mammalian, preferably human, homologous polypeptides or proteins or 
sequences encoding those proteins as proteins of the invention. 

20 The present invention discloses that adenosine kinase homologous proteins 
are regulating the energy homeostasis and fat metabolism especially the 
metabolism and storage of triglycerides, and polynucleotides, which 
identify and encode the proteins disclosed in this invention. The invention 
also relates to vectors, host cells, antibodies, and recombinant methods for 

25 producing the polypeptides and polynucleotides of the invention. The 
invention also relates to the use of these polynucleotides, polypeptides and 
effectors thereof in the diagnosis, study, prevention, and treatment of 
diseases and disorders, for example, but not limited to, metabolic diseases 
such as obesity as well as related disorders such as eating disorder, 
30 cachexia, diabetes mellitus, hypertension, coronary heart disease, 
hypercholesterolemia, dyslipidemia, osteoarthritis, gallstones, cancer, e.g. 
cancers of the reproductive organs, and sleep apnea. 



Adenosine kinase homologous proteins and nucleic acid molecules coding 
therefore are obtainable from insect or vertebrate species, e.g. mammals or 
birds. Particularly preferred are nucleic acids encoding the human 
adenosine kinase homologs (in particular the human adenosin kinase 
isoforms). 

The invention particularly relates to a nucleic acid molecule encoding a 
polypeptide contributing to regulating the energy homeostasis and the 
metabolism of triglycerides, wherein said nucleic acid molecule comprises 

(a) the nucleotide sequence of DrosophHa Ady43A, human adenosine 
kinase homologs, and/or a sequence complementary thereto, 

(b) a nucleotide sequence which hybridizes at BO^C in a solution 
containing 1 x SSC and 0.1 % SDS to a sequence of (a), 

(c) a sequence corresponding to the sequences of (a) or (b) within the 
degeneration of the genetic code, 

(d) a sequence which encodes a polypeptide which is at least 85%, 
preferably at least 90%, more preferably at least 95%, more 
preferably at least 98% and up to 99,6% identical to the amino acid 
sequences of the adenosine kinase protein, preferably of the human 
adenosine kinase homologs, 

(e) a sequence which differs from the nucleic acid mofecule of (a) to (d) 
by mutation and wherein said mutation causes an alteration, 
deletion, duplication and/or premature stop in the encoded 
polypeptide or 

(f) a partial sequence of any of the nucleotide sequences of (a) to (e) 
having a length of at least 15 bases, preferably at least 20 bases, 
more preferably at least 25 bases and most preferably at least 50 
bases. 

The present invention relates to genes with novel functions in body-weight 
regulation, energy homeostasis, metabolism, and obesity, fragments of 
said genes, polypeptides encoded by said genes or fragments thereof, and 



effectors e.g. antibodies, biologically active nucleic acids, such as 
antisense molecules, RNAi molecules or ribozymes, aptamers, peptides or 
low-molecular weight organic compounds recognizing said polynucleotides 
or polypeptides. 

The invention is based on the finding that adenosine kinase and the 
polynucleotides encoding these, are involved in the regulation of 
triglyceride storage and therefore energy homeostasis. To find genes with 
novel functions in energy homeostasis, metabolism, and obesity, a 
functional genetic screen was performed with the model organism 
Drosophila melanogaster (Meigen). One resource for screening was a 
Drosophila melanogaster stock collection of EP-lines. The P-vector of this 
collection has Gal4-UAS-binding sites fused to a basal promoter that can 
transcribe adjacent genomic Drosophila sequences upon binding of Gal4 to 
UAS-sites (Brand A. H. and Perrimon N., (1993) Development 
118:401-415; Rorth P., (1996) Proc Natl Acad Sci U S A 
93:1241 8-12422). This enables the EP-line collection for overexpression of 
endogenous flanking gene sequences. In addition, without activation of the 
UAS-sites, integration of the EP-element into the gene is likely to cause a 
reduction of gene activity, and allows determining its function by 
evaluating the loss-of-function phenotype. 

Triglycerides are the most efficient storage for energy in cells, and obese 
patients mainly show a significant increase in the content of triglycerides. 
In order to isolate genes with a function in energy homeostasis, several 
thousand EP-lines were tested for their triglyceride content after a 
prolonged feeding period (see Examples for more detail). Lines with 
significantly changed triglyceride content were selected as positive 
candidates for further analysis. The change of triglyceride content due to 
the loss of a gene function suggests gene activities in energy homeostasis 
in a dose dependent manner that controls the amount of energy stored as 
triglycerides. 



In this Invention, the content of triglycerides of a pool of flies with the 
same genotype after feeding for six days was analyzed using a triglyceride 
assay. Male flies homozygous for the Integration of vectors for Drosophila 
line HD-EP(2)21 939 were analyzed in assays measuring the triglyceride and 
glycogen contents of these flies, illustrated in more detail in the 
EXAMPLES section. The results of the triglyceride and glycogen content 
analysis are shown in FIGURE 1 . 

Genomic DNA sequences were isolated that are localized to the EP vector 
(herein HD-EP(2)21939) integration. Using those isolated genomic 
sequences public databases like Berkeley Drosophila Genome Project 
{GadFly; see also FlyBase (1999) Nucleic Acids Research 27:85-88) were 
screened thereby identifying the integration site of the vectors, and the 
corresponding gene, described in more detail in the EXAMPLES section. 
The molecular organization of the gene is shown In FIGURE 2. 

The present invention further describes polypeptides comprising the amino 
acid sequences of the proteins of the invention and homologous proteins. 
Based upon homology, the proteins of the invention and each homologous 
protein or peptide may share at least some activity. 

The invention also encompasses polynucleotides that encode the proteins 
of the invention and homologous proteins. Accordingly, any nucleic acid 
sequence, which encodes the amino acid sequences of the proteins of the 
invention and homologous proteins, can be used to generate recombinant 
molecules that express the proteins of the invention and homologous 
proteins. In a particular embodiment, the invention encompasses a nucleic 
acid encoding Drosophila Ady43A, or human adenosine kinase homoiogs; 
referred to herein as the proteins of the invention. It will be appreciated by 
those skilled in the art that as a result of the degeneracy of the genetic 
code, a multitude of nucleotide sequences encoding the proteins, some 
bearing minimal homology to the nucleotide sequences of any known and 



naturally occurring gene, may be produced. The invention contemplates 
each and every possible variation of nucleotide sequence that can be made 
by selecting combinations based on possible codon choices. 

Also encompassed by the invention are polynucleotide sequences that are 
capable of hybridizing to the claimed nucleotide sequences, and in 
particular, those of the polynucleotide encoding the proteins of the 
invention, under various conditions of stringency. Hybridization conditions 
are based on the melting temperature (Tm) of the nucleic acid binding 
complex or probe, as taught In Wahl, G, M. and S. L. Berger {1987: 
Methods Enzymol. 152:399-407) and Kimmel, A. R. (1987; Methods 
EnzymoK 152:507-511), and may be used at a defined stringency. 
Preferably, hybridization under stringent conditions means that after 
washing for 1 h with 1 x SSC and 0.1 % SDS at 50*^C, preferably at 55^C, 
more preferably at 62^C and most preferably at 68 °C, particularly for 1 h 
in 0.2 X SSC and 0.1 % SDS at 50^C, preferably at 55°C, more preferably 
at 62^C and most preferably at 68°C, a positive hybridization signal is 
observed. Altered nucleic acid sequences encoding the proteins which are 
encompassed by the invention include deletions, insertions or substitutions 
of different nucleotides resulting in a polynucleotide that encodes the same 
or a functionally equivalent protein. 

The encoded proteins may also contain deletions, insertions or 
substitutions of amino acid residues, which produce a silent change and 
result in functionally equivalent proteins. Deliberate amino acid 
substitutions may be made on the basis of similarity in polarity, charge, 
solubility, hydrophobicity, hydrophilicity, and/or the amphipathic nature of 
the residues as long as the biological activity of the protein is retained. 
Furthermore, the invention relates to peptide fragments of the proteins or 
derivatives thereof such as cyclic peptides, retro-inverso peptides or 
peptide mimetics having a length of at least 4, preferably at least 6 and up 
to 50 amino acids. 
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Also included within the scope of the present invention are alleles of the 
genes encoding the proteins of the invention and homologous proteins. As 
used herein, an 'allele' or 'allelic sequence' is an alternative form of the 
gene, which may result from at least one mutation in the nucleic acid 
sequence. Alleles may result in altered mRNAs or polypeptides whose 
structures or function may or may not be altered. Any given gene may 
have none, one or many allelic forms. Common mutational changes, which 
give rise to alleles, are generally ascribed to natural deletions, additions or 
substitutions of nucleotides. Each of these types of changes may occur 
alone or in combination witl> the others, one or more times in a given 
sequence. 

The nucleic acid sequences encoding the proteins of the invention and 
homologous proteins may be extended utilizing a partial nucleotide 
sequence and employing various methods known in the art to detect 
upstream sequences such as promoters and regulatory elements. 

In order to express a biologically active protein, the nucleotide sequences 
encoding the proteins or functional equivalents, may be inserted into 
appropriate expression vectors, i.e., a vector which contains the necessary 
elements for the transcription and translation of the inserted coding 
sequence. Methods, which are well known to those skilled In the art, may 
be used to construct expression vectors containing sequences encoding 
the proteins and the appropriate transcriptional and translational control 
elements. Regulatory elements include for example a promoter, an initiation 
codon, a stop codon, a mRNA stability regulatory element, and a 
polyadenylation signal. Expression of a polynucleotide can be assured by (i) 
constitutive promoters such as the Cytomegalovirus (CMV) 
promoter/enhancer region, (ii) tissue specific promoters such as the insulin 
promoter {see, Soria et al., 2000, Diabetes 49:157), SOX2 gene promoter 
{see Li et al., 1998, Curr. Biol. 8:971-4), Msi-1 promoter (see Sakakibara 
et al., 1997, J. Neuroscience 17:8300-8312), alpha-cardia myosin heavy 
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chain promotor or human atrial natriuretic factor promoter (Klug et ah, 
1996, J. Clin. Invest 98:216-24; Wu et a!., 1989, J. Biol. Chem. 
264:6472-79)or (iii) inducible promoters such as the tetracycline inducible 
system. Expression vectors can also contain a selection agent or marker 
gene that confers antibiotic resistance such as the neomycin, hygromycin 
or puromycin resistance genes. These methods include in vitro recombinant 
DNA techniques, synthetic techniques, and in vivo genetic recombination. 
Such techniques are described in Sambrook, J. et al. (1989) Molecular 
Cloning, A Laboratory Manual, Cold Spring Harbor Press, Plamview, N.Y. 
and Ausubel, P.M. et al. (1989) Current Protocols in Molecular Biology, 
John Wiley & Sons, New York, N.Y. 

In a further embodiment of the invention, natural, modified or recombinant 
nucleic acid sequences encoding the proteins of the invention and 
homologous proteins may be ligated to a heterologous sequence to encode 
a fusion protein. 

A variety of expression vector/host systems may be utilized to contain and 
express sequences encoding the proteins or fusion proteins. These include, 
but are not limited to, micro-organisms such as bacteria transformed with 
recombinant bacteriophage, plasmid or cosmid DNA expression vectors; 
yeast transformed with yeast expression vectors; insect cell systems 
infected with virus expression vectors (e.g., baculovirus, adenovirus, 
adeno-associated virus, lentiverus, retrovirus); plant cell systems 
transformed with virus expression vectors (e.g., cauliflower mosaic virus, 
CaMV; tobacco mosaic virus, TMV) or with bacterial expression vectors 
(e.g., Ti or PBR322 plasmids); or animal cell systems. 

The presence of polynucleotide sequences of the invention in a sample can 
be detected by DNA-DNA or DNA-RNA hybridization and/or amplification 
using probes or portions or fragments of said polynucleotides. Nucleic acid 
amplification based assays involve the use of oligonucleotides or oligomers 
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based on the sequences specific for the gene to detect transformants 
containing DNA or RNA encoding the corresponding protein. As used 
herein 'oligonucleotides' or 'oligomers' refer to a nucleic acid sequence of 
at least about 10 nucleotides and as many as about 60 nucleotides, 
preferably about 1 5 to 30 nucleotides, and more preferably about 20-25 
nucleotides, which can be used as a probe or amplimer. 

A wide variety of labels and conjugation techniques are known by those 
skilled in the art and may be used in various nucleic acid and amino acid 
assays. Means for producing labeled hybridization or PGR probes for 
detecting polynucleotide sequences include oligo-labeling, nick translation, 
end-labeling of labeled RNA probes, PGR amplification using a labeled 
nucleotide, or enzymatic synthesis. These procedures may be conducted 
using a variety of commercially available kits (Pharmacia & Upjohn, 
(Kalamazoo, Mich.); Promega (Madison Wis.); and U.S. Biochemical Corp., 
(Cleveland, Ohio). 

The presence of proteins of the invention in a sample can be determined by 
Immunological methods or activity measurement. A variety of protocols for 
detecting and measuring the expression of proteins, using either polyclonal 
or monoclonal antibodies specific for the protein or reagents for 
determining protein activity are known in the art. Examples include 
enzyme-linked immunosorbent assay (ELISA), radioimmunoassay (RIA), and 
fluorescence activated cell sorting (FAGS). A two-site, monoclonal-based 
immunoassay utilizing monoclonal antibodies reactive to two 
non-interfering epitopes on the protein is preferred, but a competitive 
binding assay may be employed. These and other assays are described, 
among other places, in Hampton, R. et al. (1990; Serological Methods, a 
Laboratory Manual, APS Press, St Paul, Minn.) and Maddox, D. E. et al. 
(1983; J. Exp. Med. 158:1211-1216). 
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Suitable reporter molecules or labels, which may be used, include 
radionuclides, enzymes, fluorescent, chemiluminescent or chromogenic 
agents as well as substrates, co-factors, inhibitors, magnetic particles, and 
the like. 

The nucleic acids encoding the proteins of the invention can be used to 
generate transgenic animal or site specific gene modifications in cell lines. 
Transgenic animals may be made through homologous recombination, 
where the normal locus of the genes encoding the proteins of the invention 
is altered. Alternatively, a nucleic acid construct is randomly integrated into 
the genome- Vectors for stable integration include plasmids, retrovirusses 
and other animal virusses, YACs, and the like. The modified cells or animal 
are useful in the study of the function and regulation of the proteins of the 
invention. For example, a series of small deletions and/or substitutions may 
be made in the genes that encode the proteins of the invention to 
determine the role of particular domains of the protein, functions in 
pancreatic differentiation, etc. 

Specific constructs of interest include anti-sense molecules, which will 
block the expression of the proteins of the invention, or expression of 
dominant negative mutations, A detectable marker, such as for example 
lac-Z, may be introduced in the locus of the genes of the invention, where 
upregulation of expression of the genes of the invention will result in an 
easily detected change'in phenotype. 

One may also provide for expression of the genes of the invention or 
variants thereof in cells or tissues where it is not normally expressed or at 
abnormal times of development. In addition, by providing expression of the 
proteins of the invention in cells In which they are not normally produced, 
one can induce changes in cell behavior. 
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DNA constructs for homologous recombination will comprise at least 
portions of the genes of the invention with the desired genetic 
modification, and will include regions of homology to the target locus, DNA 
constructs for random integration need not include regions of homology to 
mediate recombination. Conveniently, markers for positive and/or negative 
selection are included. Methods for generating cells having targeted gene 
modifications through homologous recombination are known in the art. For 
embryonic stem (ES) cells, an ES cell line may be employed, or embryonic 
cells may be obtained freshly from a host, e,g. mouse, rat, guinea pig etc. 
Such cells are grown on an appropriate fibroblast-feeder layer or grown in 
presence of leukemia inhibiting factor (LIF). 

When ES or embryonic cells or somatic pluripotent stem cells have been 
transformed, they may be used to produce transgenic animals. After 
transformation, the cells are plated onto a feeder layer in an appropriate 
medium. Cells containing the construct may be detected by employing a 
selective medium. After sufficient time for colonies to grow, they are 
picked and analyzed for the occurrence of homologous recombination or 
integration of the construct. Those colonies that are positive may then be 
used for embryo manipulation and blastocyst injection. Blastocysts are 
obtained from 4 to 6 week old superovulated females. The ES cells are 
trypsinized, and the modified cells are injected into the blastocoel of the 
blastocyst. After Injection, the blastocysts are returned to each uterine 
horn of pseudopregnant females. Females are then allowed to go to term 
and the resulting offspring screened for the construct. By providing for a 
different phenotype of the blastocyst and the genetically modified cells, 
chimeric progeny can be readily detected. The chimeric animals are 
screened for the presence of the modified gene and males and females 
having the modification are mated to produce homozygous progeny. If the 
gene alterations cause lethality at some point in development, tissues or 
organs can be maintained as allogenic or congenic grafts or transplants, or 
in vitro culture. The transgenic animals may be any non-human mammal. 
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such as laboratory animal, domestic animals, etc. The transgenic animals 
may be used in functional studies, drug screening, etc. 

Diagnostics and Therapeutics 

The data disclosed in this invention show that the nucleic acids and 
proteins of the invention and effector molecules thereof are useful in 
diagnostic and therapeutic applications implicated, for example but not 
limited to, in metabolic disorders such as obesity as well as related 
disorders such as eating-, disorder, cachexia, diabetes mellitus, 
hypertension, coronary heart disease, hypercholesterolemia, dyslipidemia, 
osteoarthritis, gallstones, cancer, e.g. cancers of the reproductive organs, 
and sleep apnea. Hence, diagnostic and therapeutic uses for the proteins of 
the invention nucleic acids and proteins of the invention are, for example 
but not limited to, the following: (i) protein therapeutic, (ii) small molecule 
drug target, (iii) antibody target (therapeutic, diagnostic, drug 
targeting/cytotoxic antibody), (iv) diagnostic and/or prognostic marker, (v) 
gene therapy (gene delivery/gene ablation), (vi) research tools, and (vii) 
tissue regeneration in vitro and in vivo (regeneration for all these tissues 
and cell types composing these tissues and cell types derived from these 
tissues). 

The nucleic acids and proteins of the invention and effectors thereof are 
useful in diagnostic afid therapeutic applications implicated in various 
applications as described below. For example, but not limited to, cDNAs 
encoding the proteins of the invention and particularly their human 
homologues may be useful in gene therapy, and the proteins of the 
invention and particularly their human homologues may be useful when 
administered to a subject in need thereof. By way of non-limiting example, 
the compositions of the present invention will have efficacy for treatment 
of patients suffering from, for example, but not limited to, in metabolic 
disorders as described above. 
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The nucleic acids of the invention or fragments thereof, may further be 
useful in diagnostic applications, wherein the presence or amount of the 
nucleic acids or the proteins are to be assessed. Further antibodies that 
bind immunospecifically to the substances of the invention may be used in 
therapeutic or diagnostic methods. 

For example, in one aspect, antibodies, which are specific for the proteins 
of the invention and homologous proteins, may be used directly as an 
effector, e.g. an antagonist or indirectly as a targeting or delivery 
mechanism for bringing a pharmaceutical agent to cells or tissue which 
express the protein. The antibodies may be generated using methods that 
are well known in the art. Such antibodies may include, but are not limited 
to, polyclonal, monoclonal, chimeric single chain. Fab fragments, and 
fragments produced by a Fab expression library. Neutralising antibodies, 
(i.e., those which inhibit dimer formation) are especially preferred for 
therapeutic use. 

For the production of antibodies, various hosts including goats, rabbits, 
rats, mice, humans, and others, may be immunized by injection with the 
protein or any fragment or oligopeptide thereof which has immunogenic 
properties. Depending on the host species, various adjuvants may be used 
to increase immunological response. It is preferred that the peptides, 
fragments or oligopeptides used to induce antibodies to the protein have an 
amino acid sequence consisting of at least five amino acids, and more 
preferably at least 10 amino acids. 

Monoclonal antibodies to the proteins may be prepared using any 
technique that provides for the production of antibody molecules by 
continuous cell lines in culture. These include, but are not limited to, the 
hybridoma technique, the human B-cell hybridoma technique, and the 
EBV-hybridoma technique (Kohler, G. et al. (1975) Nature 256:495-497; 
Kozbor, D. et al. (1985) J. Immunol. Methods 81 :31-42; Cote, R. J. et al. 
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Proc. Natl. Acad. Sci. 80:2026-2030; Cole, S. P. et al. (1984) Mol. Cell 
Biol. 62:109-120)- 

In addition, techniques developed for the production of 'chimeric 
antibodies', the splicing of mouse antibody genes to human antibody genes 
to obtain a molecule with appropriate antigen specificity and biological 
activity can be used (Morrison, S. L. et aL (1984) Proc. Natl. Acad. Sci. 
81:6851-6855; Neuberger, M. S. et al (1984) Nature 312:604-608; 
Takeda, S. et aL (1985) Nature 314:452-454). Alternatively, techniques 
described for the production ,of single chain antibodies may be adapted, 
using methods known in the art, to produce single chain antibodies specific 
for the proteins of the invention and homologous proteins. Antibodies with 
related specificity, but of distinct idiotypic composition, may be generated 
by chain shuffling from random combinatorial immunoglobulin libraries 
(Burton, D. R. (1991) Proc. Natl. Acad. Sci. 88:1 1 120-:3). Antibodies may 
also be produced by inducing in vivo production in the lymphocyte 
population or by screening recombinant immunoglobulin libraries or panels 
of highly specific binding reagents as disclosed in the literature (Orlandi, R. 
etaL (1989) Proc. Natf. Acad. Sci. 86:3833-3837; Winter, G. et aL (1991) 
Nature 349:293-299). 

Antibody fragments which contain specific binding sites for the proteins 
may also be generated. For example, such fragments include, but are not 
limited to, the F(ab')2 fragments which can be produced by Pepsin 
digestion of the antibody molecule and the Fab fragments which can be 
generated by reducing the disulfide bridges of F(ab')2 fragments. 
Alternatively, Fab expression libraries may be constructed to allow rapid 
and easy identification of monoclonal Fab fragments with the desired 
specificity (Huse, W. D. et aL (1989) Science 254:1275-1281). 

Various immunoassays may be used for screening to identify antibodies 
having the desired specificity. Numerous protocols for competitive binding 
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and immunoradiometric assays using either polyclonal or monoclonal 
antibodies with established specificities are well known in the art. Such 
immunoassays typically involve the measurement of complex formation 
between the protein and its specific antibody. A two-site, 
monoclonal-based immunoassay utilizing monoclonal antibodies reacive to 
two non-interfering protein epitopes are preferred, but a competitive 
binding assay may also be employed {Maddox, supra). 

In another embodiment of the invention, the polynucleotides or fragments 
thereof or nucleic acid effector molecules such as antisense molecules, 
aptamers, RNAI molecules or ribozymes may be used for therapeutic 
purposes. In one aspect, aptamers, i.e. nucleic acid molecules, which are 
capable of binding to an AOK protein and modulating its activity, may be 
generated by a screening and selection procedure Involving the use of 
combinatorial nucleic acid libraries. 

In a further aspect, antisense molecules may be used in situations in which 
it would be desirable to block the transcription of the mRNA. In particular, 
cells may be transformed with sequences complementary to 
polynucleotides encoding the proteins of the invention and homologous 
proteins. Thus, antisense molecules may be used to modulate protein 
activity or to achieve regulation of gene function. Such technology is now 
well known In the art, and sense or antisense oligomers or larger 
fragments, can be designed from various locations along the coding or 
control regions of sequences encoding the proteins. Expression vectors 
derived from retroviruses, adenovirus, herpes or vaccinia viruses or from 
various bacterial plasmids may be used for delivery of nucleotide 
sequences to the targeted organ, tissue or cell population. Methods, which 
are well known to those skilled in the art, can be used to construct 
recombinant vectors, which will express antisense molecules 
complementary to the polynucleotides of the genes encoding the proteins 
of the invention and homologous proteins. These techniques are described 
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both in Sambrook et al. (supra) and in Ausubel et al. (supra). Genes 
encoding the proteins of the invention and honnologous proteins can be 
turned off by transforming a cell or tissue with expression vectors, which 
express high levels of polynucleotides that encode the proteins of the 
invention and homologous proteins or fragments thereof. Such constructs 
may be used to introduce untranslatable sense or antisense sequences into 
a cell. Even in the absence of integration into the DNA, such vectors may 
continue to transcribe RNA molecules until they are disabled by 
endogenous nucleases. Transient expression may last for a month or more 
with a non-replicating vector- and even longer if appropriate replication 
elements are part of the vector system. 

As mentioned above, modifications of gene expression can be obtained by 
designing antisense molecules, e.g. DNA, RNA or nucleic acid analogues 
such as PNA, to the control regions of the genes encoding the proteins of 
the invention and homologous proteins, i.e., the promoters, enhancers, and 
introns. Oligonucleotides derived from the transcription initiation site, e.g., 
between positions -1 0 and -h 1 0 from the start site, are preferred. Similarly, 
inhibition can be achieved using "triple helix" base-pairing methodology. 
Triple helix pairing is useful because it cause inhibition of the ability of the 
double helix to open sufficiently for the binding of polymerases, 
transcription factors or regulatory molecules. Recent therapeutic advances 
using triplex DNA have been described In the literature (Gee, J. E. et al. 
(1994) In; Huber, B. E. and B. I. Carr, Molecular and Immunologic 
Approaches, Futura Publishing Co., Mt. Kisco, N.Y.). The antisense 
molecules may also be designed to block translation of mRNA by 
preventing the transcript from binding to ribosomes. 

Ribozymes, enzymatic RNA molecules, may also be used to catalyze the 
specific cleavage of RNA. The mechanism of ribozyme action involves 
sequence-specific hybridization of the ribozyme molecule to complementary 
target RNA, followed by endonucleolytic cleavage. Examples, which may 
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be used, include engineered hannnnerhead motif ribozyme molecules that 
can be specifically and efficiently catalyze endonucleolytic cleavage of 
sequences encoding the proteins of the invention and homologous 
proteins. Specific ribozyme cleavage sites within any potential RNA target 
are initially identified by scanning the target molecule for ribozyme 
cleavage sites which include the following sequences: GUA, GUU, and 
GUC. Once identified, short RNA sequences of between 15 and 20 
ribonucleotides corresponding to the region of the target gene containing 
the cleavage site may be evaluated for secondary structural features which 
may render the oligonucleotide inoperable. The suitability of candidate 
targets may also be evaluated by testing accessibility to hybridization with 
complementary oligonucleotides using ribonuclease protection assays. 

Nucleic acid effector molecules, e.g. antisense molecules and ribozymes of 
the invention may be prepared by any method known in the art for the 
synthesis of nucleic acid molecules. These include techniques for 
chemically synthesizing oligonucleotides such as solid phase 
phosphoramidite chemical synthesis. Alternatively, RNA molecules may be 
generated by in vitro and in vivo transcription of DNA sequences. Such 
DN A sequences may be incorporated into a variety of vectors with suitable 
RNA polymerase promoters such as T7 or SP6. Alternatively, these cDNA 
constructs that synthesize antisense RNA constitutively or tnducibly can be 
introduced into cell lines, cells or tissues. RNA molecules may be modified 
to increase intracellular stability and half-life. Possible modifications 
include, but are not limited to, the addition of flanking sequences at the 5' 
and/or 3' ends of the molecule or modifications in the nucleobase, sugar 
and/or phosphate moieties, e.g. the use of phosphorothioate or 2' 
O-methyl rather than phosphodiesterase linkages within the backbone of 
the molecule. This concept is inherent in the production of PNAs and can 
be extended in all of these molecules by the inclusion of non-traditional 
bases such as inosine, queosine, and wybutosine, as well as acetyl-, 
methyl-, thio-, and similarly modified forms of adenine, cytidine, guanine. 
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thymine, and uridine which are not as easily recognized by endogenous 
endonucleases. 

Many methods for introducing vectors into cells or tissues are available and 
equally suitable for use in vivo, in vitro, and ex vivo. For ex vivo therapy, 
vectors may be Introduced into stem cells taken from the patient and 
clonaify propagated for autologous transplant back into that same patient. 
Delivery by transfection and by liposome injections may be achieved using 
methods, which are well known in the art. Any of the therapeutic methods 
described above may be applied to any suitable subject including, for 
example, mammals such as dogs, cats, cows, horses, rabbits, monkeys, 
and most preferably, humans. 

An additional embodiment of the invention relates to the administration of 
a pharmaceutical composition, in conjunction with a pharmaceutically 
acceptable carrier, for any of the therapeutic effects discussed above. 
Such pharmaceutical compositions may consist of the nucleic acids and the 
proteins of the invention and homologous nucleic acids or proteins, 
antibodies to the proteins of the invention and homologous proteins, 
mimetics, agonists, antagonists or inhibitors of the proteins of the 
invention and homologous proteins or nucleic acids» The compositions may 
be administered alone or in combination with at least one other agent, such 
as stabilizing compound, which may be administered in any sterile, 
biocompatible pharmaceutical carrier, including, but not limited to, saline, 
buffered saline, dextrose, and water. The compositions may be 
administered to a patient alone or in combination with other agents, drugs 
or hormones. The pharmaceutical compositions utilized In this invention 
may be administered by any number of routes including, but not limited to, 
oral, intravenous, intramuscular, intra-arterial, intramedullary, intrathecal, 
intraventricular, transdermal, subcutaneous, intraperitoneal, intranasal, 
enteral, topical, sublingual or rectal means. 
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In addition to the active ingredients, these pharmaceutical compositions 
may contain suitable pharmaceutically-acceptable carriers comprising 
excipients and auxiliaries, which facilitate processing of the active 
compounds into preparations, which can be used pharmaceutically. Further 
details on techniques for formulation and administration may be found in 
the latest edition of Remington's Pharmaceutical Sciences (Maack 
Publishing Co., Easton, Pa.). 

Pharmaceutical compositions suitable for use in the invention include 
compositions wherein the actwe ingredients are contained in an effective 
amount to achieve the intended purpose. The determination of an effective 
dose is well within the capability of those skilled in the art. For any 
compounds, the therapeutically effective does can be estimated initially 
either in cell culture assays, e.g., of preadipocyte cell lines or in animal 
models, usually mice, rabbits, dogs or pigs. The animal model may also be 
used to determine the appropriate concentration range and route of 
administration. Such information can then be used to determine useful 
doses and routes for administration in humans. A therapeutically effective 
dose refers to that amount of active ingredient, for example the nucleic 
acids or the proteins of the invention or fragments thereof or antibodies, 
which is sufficient for treating a specific condition. Therapeutic efficacy 
and toxicity may be determined by standard pharmaceutical procedures in 
cell cultures or experimental animals, e,g., ED50 (the dose therapeutically 
effective in 50% of the population) and LD50 {the dose lethal to 50% of 
the population). The dose ratio between therapeutic and toxic effects is the 
therapeutic index, and it can be expressed as the ratio, LD50/ED50. 
Pharmaceutical compositions, which exhibit large therapeutic indices, are 
preferred. The data obtained from ceil culture assays and animal studies is 
used in formulating a range of dosage for human use. The dosage 
contained in such compositions is preferably within a range of circulating 
concentrations that include the ED50 with little or no toxicity. The dosage 
varies within this range depending upon the dosage from employed. 
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sensitivity of the patient, and the route of administration. The exact dosage 
will be determined by the practitioner, in light of factors related to the 
subject that requires treatment. Dosage and administration are adjusted to 
provide sufficient levels of the active moiety or to maintain the desired 
effect. Factors, which may be taken into account, include the severity of 
the disease state, general health of the subject, age, weight, and gender of 
the subject, diet, time and frequency of administration, drug 
combination(s), reaction sensitivities, and tolerance/response to therapy. 
Long-acting pharmaceutical compositions may be administered every 3 to 
4 days, every week or once every two weeks depending on half-life and 
clearance rate of the particular formulation. Normal dosage amounts may 
vary from 0.1 to 100,000 micrograms, up to a total dose of about 1 g, 
depending upon the route of administration. Guidance as to particular 
dosages and methods of delivery is provided in the literature and generally 
available to practitioners in the art. Those skilled in the art employ different 
formulations for nucleotides than for proteins or their inhibitors. Similarly, 
delivery of polynucleotides or polypeptides will be specific to particular 
cells, conditions, locations, etc. 

In another embodiment, antibodies which specifically bind to the proteins 
may be used for the diagnosis of conditions or diseases characterized by or 
associated with over- or underexpression of the proteins of the invention 
and homologous proteins or in assays to monitor patients being treated 
with the proteins of the invention and homologous proteins, or effectors 
thereof, e.g. agonists, antagonists, or inhibitors. Diagnostic assays include 
methods which utilize the antibody and a label to detect the protein in 
human body fluids or extracts of cells or tissues. The antibodies may be 
used with or without modification, and may be labeled by joining them, 
either covalently or non-covalently, with a reporter molecule. A wide 
variety of reporter molecules which are known in the art may be used 
several of which are described above. 
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A variety of protocols Including ELISA, RIA, and FACS for measuring 
proteins are known in tiie art and provide a basis for diagnosing altered or 
abnormal levels of gene expression. Normal or standard values for gene 
expression are established by combining body fluids or cell extracts taken 
from normal mammalian subjects, preferably human, with antibodies to the 
protein under conditions suitable for complex formation. The amount of 
standard complex formation may be quantified by various methods, but 
preferably by photometric means. Quantities of protein expressed In control 
and disease samples e.g. from biopsied tissues are compared with the 
standard values. Deviation- between standard and subject values 
establishes the parameters for diagnosing disease. 

In another embodiment of the invention, the polynucleotides specific for 
the proteins of the invention and homologous proteins may be used for 
diagnostic purposes. The polynucleotides, which may be used, include 
oligonucleotide sequences, antisense RNA and DNA molecules, and PNAs. 
The polynucleotides may be used to detect and quantitate gene expression 
in biopsied tissues in which gene expression may be correlated with 
disease. The diagnostic assay may be used to distinguish between 
absence, presence, and excess gene expression, and to monitor regulation 
of protein levels during therapeutic intervention. 

In one aspect, hybridization with probes which are capable of detecting 
polynucleotide sequences, including genomic sequences, encoding the 
proteins of the invention and homologous proteins or closely related 
molecules, may be used to identify nucleic acid sequences which encode 
the respective protein. The hybridization probes of the subject invention 
may be DNA or RNA and are preferably derived from the nucleotide 
sequence of the polynucleotide encoding the proteins of the invention or 
from a genomic sequence including promoter, enhancer elements, and 
introns of the naturally occurring gene. Hybridization probes may be labeled 
by a variety of reporter groups, for example, radionuclides such as ^^P or 
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^^S or enzymatic labels, such as alkaline phosphatase coupled to the probe 
via avidin/biotin coupling systems, and the like. ^ 

Polynucleotide sequences specific for the proteins of the invention and 
homologous nucleic acids may be used for the diagnosis of conditions or 
diseases, which are associated with the expression of the proteins. 
Examples of such conditions or diseases include, but are not limited to, 
pancreatic diseases and disorders, including diabetes. Polynucleotide 
sequences specific for the proteins of the invention and homologous 
proteins may also be used to monitor the progress of patients receiving 
treatment for pancreatic diseases and disorders, including diabetes. The 
polynucleotide sequences may be used qualitative or quantitative assays, 
e.g. in Southern or Northern analysis, dot blot or other membrane-based 
technologies; in PGR technologies; or In dip stick, pin, ELISA or chip assays 
utilizing fluids or tissues from patient biopsies to detect altered gene 
expression. 

In a particular aspect, the nucleotide sequences specific for the proteins of 
the invention and homologous nucleic acids may be useful in assays that 
detect activation or induction of various metabolic diseases such as obesity 
as well as related disorders such as eating disorder, cachexia, diabetes 
mellitus, hypertension, coronary heart disease, hypercholesterolemia, 
dysllpidemla, osteoarthritis, gallstones, cancers of the reproductive organs, 
and sleep apnea. The nucleotide sequences may be labeled by standard 
methods, and added to a fluid or tissue sample from a patient under 
conditions suitable for the formation of hybridization complexes. After a 
suitable incubation period, the sample is washed and the signal is 
quantitated and compared with a standard value. If the amount of signal in 
the biopsied or extracted sample is significantly altered from that of a 
comparable have hybridized with nucleotide sequences in the sample, and 
the presence of altered levels of nucleotide sequences encoding the 
proteins of the invention and homologous proteins in the sample indicates 
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the presence of the associated disease. Such assays may also be used to 
evaluate the efficacy of a particular therapeutic treatment regimen in 
animal studies, in clinical trials or in monitoring the treatment of an 
individual patient. 

In order to provide a basis for the diagnosis of a disease associated with 
expression of the proteins of the invention and homologous proteins, a 
normal or standard profile for expression is established. This may be 
accomplished by combining body fluids or cell extracts taken from normal 
subjects, either animal or human, with a sequence or a fragment thereof, 
which is specific for the nucleic acids encoding the proteins of the 
invention and homologous nucleic acids, under conditions suitable for 
hybridization or amplification. Standard hybridization may be quantified by 
comparing the values obtained from normal subjects with those from an 
experiment where a known amount of a substantially purified 
polynucleotide is used. Standard values obtained from normal samples may 
be compared with values obtained from samples from patients who are 
symptomatic for disease. Deviation between standard and subject values 
is used to establish the presence of disease. Once disease Is established 
and a treatment protocol Is initiated, hybridization assays may be repeated 
on a regular basis to evaluate whether the level of expression in the patient 
begins to approximate that, which is observed in the normal patient. The 
results obtained from successive assays may be used to show the efficacy 
of treatment over a period ranging from several days to months. 

With respect to metabolic diseases such as described above the presence 
of an unusual amount of transcript in biopsied tissue from an individual 
may indicate a predisposition for the development of the disease or may 
provide a means for detecting the disease prior to the appearance of actual 
clinical symptoms. A more definitive diagnosis of this type may allow 
health professionals to employ preventative measures or aggressive 
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treatment earlier thereby preventing the development or further progression 
of the metabolic diseases and disorders. 

Additional diagnostic uses for oligonucleotides designed from the 
sequences encoding the proteins of the invention and homologous proteins 
may involve the use of PGR. Such oligomers may be chemically 
synthesized, generated enzymatically or produced from a recombinant 
source. Oligomers will preferably consist of two nucleotide sequences, one 
with sense orientation (5pr!me.fwdarw,3prime) and another with antisense 
(3prime.rarw.5prime), employed under optimized conditions for 
identification of a specific gene or condition. The same two oligomers, 
nested sets of oligomers or even a degenerate pool of oligomers may be 
employed under less stringent conditions for detection and/or quantification 
of closely related DNA or RNA sequences. 

In another embodiment of the invention, the nucleic acid sequences may 
also be used to generate hybridization probes, which are useful for 
mapping the naturally occurring genomic sequence. The sequences may be 
mapped to a particular chromosome or to a specific region of the 
chromosome using well known techniques. Such techniques include FISH, 
FACS or artificial chromosome constructions, such as yeast artificial 
chromosomes, bacterial artificial chromosomes, bacterial PI constructions 
or single chromosome cDNA libraries as reviewed in Price, C. M. (1993) 
Blood Rev. 7:127-134, and Trask, B. J. (1991) Trends Genet. 7:149-154. 
FISH (as described in Verma et al. (1 988) Human Chromosomes: A Manual 
of Basic Techniques, Pergamon Press, New York, N.Y.). The results may 
be correlated with other physical chromosome mapping techniques and 
genetic map data. Examples of genetic map data can be found in the 1 994 
Genome Issue of Science (265: 198 If). Correlation between the location of 
the gene encoding the proteins of the invention on a physical chromosomal 
map and a specific disease or predisposition to a specific disease, may help 
to delimit the region of DNA associated with that genetic disease. 
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The nucleotide sequences of the subject invention may be used to detect 
differences in gene sequences between normal, carrier or affected 
individuals. An analysis of polymorphisms, e.g. single nucleotide 
polymorphisms may be carried out. Further, in situ hybridization of 
chromosomal preparations and physical mapping techniques such as 
linkage analysis using established chromosomal markers may be used for 
extending genetic maps. Often the placement of a gene on the 
chromosome of another mammalian species, such as mouse, may reveal 
associated markers even if the number or arm of a particular human 
chromosome is not known. New sequences can be assigned to 
chromosomal arms or parts thereof, by physical mapping. This provides 
valuable information to investigators searching for disease genes using 
positional cloning or other gene discovery techniques. Once the disease or 
syndrome has been crudely localized by genetic linkage to a particular 
genomic region, for example, AT to 11q22-23 (Gattr, R. A. et al. (1988) 
Nature 336:577-580), any sequences mapping to that area may represent 
associated or regulatory genes for further Investigation. The nucleotide 
sequences of the subject invention may also be used to detect differences 
in the chromosomal location due to translocation, inversion,, etc. among 
normal, carrier or affected individuals. 

In another embodiment of the invention, the proteins of the invention, its 
catalytic or immunogenic fragments or oligopeptides thereof, an in vitro 
model, a genetically altered cell or animal, can be used for screening 
libraries of compounds in any of a variety of drug screening techniques. 
One can identify effectors, e.g. receptors, ligands or substrates that bind 
to, modulate or mimic the action of one or more of the proteins of the 
invention. The protein or fragment thereof employed in such screening may 
be free in solution, affixed to a solid support, borne on a cell surface, or 
located intracellularly. The formation of binding complexes, between the 
proteins of the invention and the agent tested, may be measured. Agents 
can also be identified, which, either directly or indirectly, influence the 
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activity of the protein of the invention. Of particular interest are screening 
assays for agents that have a low toxicity for mammalian cells. The term 
"agent" as used herein describes any molecule, e.g. protein or 
pharmaceutical, with the capability of altering or mimicking the 
physiological function of one or more of the proteins of the Invention. 
Candidate agents encompass numerous chemical classes, though typically 
they are organic molecules, preferably small organic compounds having a 
molecular weight of more than 50 and less than about 2,500 Daltons. 
Candidate agents comprise functional groups necessary for structural 
interaction with proteins, particularly hydrogen bonding, and typically 
include at least an amine, carbonyl, hydroxyl or carboxyl group, preferably 
at least two of the functional chemical groups. The candidate agents often 
comprise carbocyclic or heterocyclic structures and/or aromatic or 
polyaromatic structures substituted with one or more of the above 
functional groups. 

Candidate agents are also found among biomolecules including peptides, 
saccharides, fatty acids, steroids, purines, pyrimidines, nucleic acids and 
derivatives, structural analogs or combinations thereof- Candidate agents 
are obtained from a wide variety of sources including libraries of synthetic 
or natural compounds. For example, numerous means are avaiiabie for 
random and directed synthesis of a wide variety of organic compounds and 
biomolecules, including expression of randomized oligonucleotides and 
oligopeptides. Alternatively, libraries of natural compounds in the form of 
bacterial, fungal, plant and animal extracts are available or readily 
produced. Additionally, natural or synthetically produced libraries and 
compounds are readily modified through conventional chemical, physical 
and biochemical means, and may be used to produce combinatorial 
libraries. Known pharmacological agents may be subjected to directed or 
random chemical modifications, such as acylation, alkylation, esterif ication, 
amidification, etc. to produce structural analogs. Where the screening 
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assay is a binding assay, one or more of the molecules may be joined to a 
label, where the label can directly or indirectly provide a detectable sigrial, 

Another technique for drug screening, which may be used, provides for 
high throughput screening of compounds having suitable binding affinity to 
the protein of interest as described in published PCT application 
WO84/03564. In this method, as applied to the proteins of the invention 
large numbers of different small test compounds, e.g. aptamers, peptides, 
low-molecular weight compounds etc., are provided or synthesized on a 
solid substrate, such as plastic pins or some other surface. The test 
compounds are reacted with the proteins or fragments thereof, and 
washed. Bound proteins are then detected by methods well known in the 
art. Purified proteins can also be coated directly onto plates for use in the 
aforementioned drug screening techniques. Alternatively, non-neutralizing 
antibodies can be used to capture the peptide and immobilize it on a solid 
support. In another embodiment, one may use competitive drug screening 
assays in which neutralizing antibodies capable of binding the protein 
specifically compete with a test compound for binding the protein. In this 
manner, the antibodies can be used to detect the presence of any peptide, 
which shares one or more antigenic determinants with the protein. 

Finally, the invention also relates to a kit comprising at least one of 

(a) an adenosine kinase nucleic acid molecule or a fragment thereof; 

(b) a vector comprising the nucleic acid of (a); 

(c) a host celt comprising the nucleic acid of (a) or the vector of (b); 

(d) a polypeptide encoded by the nucleic acid of (a); 

(e) a fusion polypeptide encoded by the nucleic acid of (a); 

(f) an antibody, an aptamer or another receptor against the nucleic acid 
of (a) or the polypeptide of (d) or (e) and 

(g) an anti-sense oligonucleotide of the nucleic acid of (a). 
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The kit may be used for diagnostic or therapeutic purposes or for screening 
applications as described above. The l^it may further contain user 
instructions. 

The Figures show: 

FIGURE 1 shows the triglyceride content and the glycogen content of 
Drosophila Ady43A (GadFly Accession Number CGI 851) mutants. Shown 
is the change of triglyceride content of HD-EP(2)21 939 flies caused by 
integration of the P-vector into the into the annotated transcription unit 
(column 3) in comparison to controls containing more than 2000 fly lines 
of the proprietary EP collection CHD-control (TG)% column 1) and wildtype 
controls determined in more than 80 independent assays (referred to as 
'WT-control (TG)' column 2). Also shown is the change of glycogen 
content of HD-EP(2)21939 flies caused by integration of the P-vector into 
the into the annotated transcription unit (column 5) in comparison to 
controls (referred to as 'control (glycogen)' column 4). 

FIGURE 2 shows the molecular organization of the mutated Ady42A 
(Gadfly Accession Number CGI 851) gene locus. 

The examples illustrate the inventfon: 

Example 1 : Measurement of energy storage metabolites (ESM) content in 
Drosophila 

Mutant flies are obtained from a fly mutation stock collection. The flies are 
grown under standard conditions known to those skilled in the art. In the 
course of the experiment, additional feedings with bakers yeast 
(Saccharomyces cerevisiae) are provided for the EP-line HD-EP21939. The 
average change of triglyceride and glycogen (herein referred to as energy 
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storage metabolites, ESM) content of Drosophila containing the EP-vector 
as liomozygous viable integration was investigated in comparison to 
control flies, respectively {see FIGURE 1), For determination of ESM 
content, flies were incubated for 5 min at 90°C in an aqueous buffer using 
a waterbath, followed by hot extraction. After another 5 min incubation at 
90°C and mild centrifugation, the triglyceride content of the flies extract 
was determined using Sigma Triglyceride (INT 336-10 or -20) assay by 
measuring changes in the optical density according to the manufacturer's 
protocol, and the glycogen content of the flies extract was determined 
using Roche (Starch UV-method Cat. No. 0207748) assay by measuring 
changes in the optical density according to the manufacturer's protocol. As 
a reference the protein content of the same extract was measured using 
BIO-RAD DC Protein Assay according to the manufacturer's protocol. 
These experiments and assays were repeated several times. 

The average triglyceride level i/jg triglyceride///g protein) of 2108 fly lines 
of the proprietary EP-collection (referred to as 'HD-control (TG)') is shown 
as 100% in the first column in FIGURE 1 . The average triglyceride level (//g 
triglyceride///g protein) of wlldtype OreR flies was determined in 84 
independent assays (referred to as 'WT-control (TG)') is shown as 102% 
in the second column in FIGURE 1. The average glycogen lever [jjq 
glycogen///g protein) of an internal assay control consisting of two different 
wildtype strains and an inconspicuous EP-line of the HD stock collection 
(referred to as 'control (glycogen)') is shown as 100% in the fourth column 
in FIGURE 1 . Standard deviations of the measurements are shown as thin 
bars. HD-21939 homozygous flies show constantly a higher triglyceride 
content (//g triglyceride/A/g protein) than the controls (column 3 in FIGURE 
1, 'HD-21939 (TG)'). HD-21939 homozygous flies also show a higher 
glycogen content {jjq triglyceride///g protein) than the controls (column 5 in 
FIGURE 1, 'HD-21939 (glycogen)'). Therefore, the loss of gene activity is 
responsible for changes in the metabolism of the energy storage 
triglycerides. 
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Example 2: Identification of a Drosophila gene responsible for the change 
in triglyceride and glycogen contents 

Genomic DNA sequences were isolated that are localized directly adjacent 
to the EP vector {herein HD-EP(2)21 939) integration. Using those isolated 
genomic sequences public databases like Berkeley Drosophila Genome 
Project (GadFly) were screened thereby confirming the homozygous viable 
integration site of the HD-EP{2)21 939 vector 6656 base pairs 5prime of 
the cDNA CGI 851 -RB in sense orientation. FIGURE 2 shows the molecular 
organization of this gene locus. The chromosomal localization site of 
integration of the vector of HD-EP(2)21 939 is at gene locus 2R, 43A2 
(according to Flybase and Gadfly release 3), In FIGURE 2, genomic DNA 
sequence is represented by the assembly as a black arrow in middle of the 
figure that includes the integration site of HD-EP(2)21 939. Ticks represent 
the length in basepairs of the genomic DNA (1000 base pairs per tick). 
Dark grey bars in the lower half of the figure, linked by dark grey lines 
represent cDNAs of the predicted genes (as predicted by the Berkeley 
Drosophila Genome Project, GadFly release 3). Predicted exons of the 
Drosophila cDNA of the gene Ady43A (GadFly Accession Number 
CGI 851) are shown as dark grey bars and predicted introns as slim grey 
lines in the lower half of the figure and are labeled. Therefore, expression 
of the cDNA encoding Ady43A could be affected by integration of the 
vector of line HD-EP(2)21 939, leading to a change in the amount of energy 
storage triglycerides- 

Table 1 is summarizing the data of our molecular analysis of the Drosophila 
protein identified in this invention as being involved in the regulation of the 
metabolism- 
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Table 1 . Molecular analysis of Drosophila Ady43A 



Analysis 


Ady43A 


Genetic 
interaction: 


not described (Flybase) 


Protein 


adenosine kinase (Flvbase^ 


Protein 
domains 


Ribokinase-like (Flybase) 


InterPro 
analysis 


Carbohydrate kinase, PfkB (IPR002173) 


Drosophila 
functional data 


not described (Flybase) 


Locus 


2R. 43A2 (FIvbase): 2R 43A2(Gadfly release 3^ 


Ests 


GM02629 


CDNA 


AA695338 (525 base pairs nnRNA), AJ236864 (3457 
base pairs mRNA), AW941174 (431 base pairs mRNA) 
(Flybase) ..... 


genomic DNA 


AE003842 (252463 base pairs, DNA; protein :AAF59279; 
protein:AAM6891 1), AY1 02669 (protein:AAM27498) 
fFlvbase^ 


NCBI locus ID 


44750, Dm Ady43A, 43A3; Aliases: Adk43A, CG1851, 

AE003842, AJ236864, AW941174, AY1 02669; Protftin: 
NP_651995 (366 amino acids), AAF59279 (220 amino 
acids), AAM68911 (220 amino acids), AAM27498 (366 
amino acids) 


Drosophila 
mutations & 
mutants 


not described (Flybase) 



Example 3: Identification of human ADK homologous genes and proteins 



ADK homologous proteins and nucleic acid molecules coding therefore are 
obtainable from insect or vertebrate species, e»g. mammals or birds. 
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Particularly preferred are nucleic acids comprising Drosophila Ady43A, or 
human ADK homologs (in particular the human adenosine kinase, isoform 
a, and the human adenosine kinase, isoform b). 

Sequences homologous to Drosophila Ady43A were identified using the 
publicly available program BLASTP 2,2.3 of the non-redundant protein data 
base of the National Center for Biotechnology Information (NCBI) {see, 
Altschul et a!., 1997, Nucleic Acids Res. 25:3389-3402). Table 2 shows 
the best human homologs of the Drosophila Ady43A. 

Table 2- Human homologous proteins to Drosophila Ady43A protein 

* NCBI (National Center for Biotechnology Information) human locus 
identification (ID): 132, Hs ADK, adenosine kinase; 10cen-q24 

* OMIM: 102750 

* RefSeq[R]: GenBank Accession Numbers NM_001 1 23 (45% homology of 
amino acids 35-366 of the Drosophila Ady43A protein (GenBank Accession 
Number NP_651995) to amino acids 13-339 of the human adenosine 
kinase, isoform a (345 amino acids in total)) and NM_006721 (45% 
homology of amino acids 35-366 of the Drosophila Ady43A protein 
(GenBank Accession Number NP_651995) to amino acids 30-356 of the 
human adenosine kinase, isoform b (362 amino acids in total)) 

* Nucleotide: GenBank Accession Numbers BC003568, U33936, U50196, 
U90338, U90339 

* Protein: GenBank Accession Numbers NP_001114 (345 amino acids), 
NP_00671 2 (362 amino acids), AAH03568 (345 amino acids), AAB01 689 
(334 amino acids), AAA97893 (345 amino acids), AAB50234 (362 amino 
acids), AAB50235 (345 amino acids) 
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The mouse homologous cDNAs encoding the polypeptides of the invention 
were identified as GenBank Accession Numbers NM_1 34079 and 
NM 007411. 
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Claims 

A pharmaceutical composition comprising a nucleic acid molecule of 
tlie adenosine kinase gene family or a polypeptide encoded thereby 
or a fragment or a variant of said nucleic acid molecule or said 
polypeptide or an effector of said nucleic acid molecule or 
polypeptide together with pharmaceutically acceptable carriers, 
diluents and/or adjuvants. 

The composition of claim 1 , wherein the nucleic acid molecule is a 
vertebrate or insect adenosine kinase nucleic acid, particulary 
encoding the human adenosine kinase homologs, and/or a nucleic 
molecule which is complementary thereto or a fragment thereof or a 
variant thereof. 

The composition of claim 1 or 2, wherein said nucleic acid molecule 

(a) hybridizes at 50^C in a solution containing 1 x SSC and 0.1 % 
SDS to a nucleic acid molecule as defined in claim 2 and/or a 
nucleic acid molecule which is complementary thereto; 

(b) it is degenerate with respect to the nucleic acid molecule of 
(a), 

(c) encodes a polypeptide which is at least 85%, preferably at 
least 90%, more preferably at least 95%, more preferably at 
least 98% and up to 99,6% identical to a human adenosine 
kinase , as defined in claim 2; 

(d) differs from the nucleic acid molecule of (a) to (c) by mutation 
and wherein said mutation causes an alteration, deletion, 
duplication or premature stop in the encoded polypeptide. 

The composition of any one of claims 1 -3, wherein the nucleic acid 
molecule is a DNA molecule, particularly a cDNA or a genomic DNA. 
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The composition of any one of claims 1 -4, wherein said nucleic acid 
encodes a polypeptide contributing to regulating the energy 
homeostasis and/or the metabolism of triglycerides. 

The composition of any one of claims 1-5, wherein said nucleic acid 
molecule is a recombinant nucleic acid molecule. 

The composition of any one of claims 1 -6, wherein Itie nucleic acid 
molecule is a vector, particularly an expression vector. 

The composition of any one of claims 1-5, wherein the polypeptide 
is a recombinant polypeptide. 

The composition of claim 8, wherein said recombinant polypeptide is 
a fusion polypeptide. 

The composition of any one of claims 1 -7, wherein said nucleic acid 
molecule is selected from hybridization probes, primers and 
anti-sense oligonucleotides. 

The composition of any one of claims 1 -1 0 which is a diagnostic 
composition. 

The composition of any one of claims 1-10 which is a therapeutic 
composition. 

The composition of any one of claims 1-12 for the manufacture of 
an agent for detecting and/or verifying, for the treatment, alleviation 
and/or prevention of an disorders, including metabolic diseases such 
as obesity and other body-weight regulation disorders as well as 
related disorders such as eating disorder, cachexia, diabetes 
mellitus, hypertension, coronary heart disease, hyper- 
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cholesterolemia, dyslipidemia, osteoarthritis, gallstones, cancer, e.g. 
cancers of the reproductive organs, and sleep apnea and others, in 
cells, cell masses, organs and/or subjects. 

5 14. Use of a nucleic acid molecule of the adenosine kinase gene family 
or a polypeptide encoded thereby or a fragment or a variant of said 
nucleic acid molecule or said polypeptide or an effector of said 
nucleic or polypeptide for controlling the function of a gene and/or a 
gene product which is influenced and/or modified by an adenosine 
10 kinase homologous polypeptide. 

1 5. Use of the nucleic acid molecule of the adenosine kinase gene family 
or a polypeptide encoded thereby or a fragment or a variant of said 
nucleic acid molecule or said polypeptide or an effector of said 

15 nucleic acid molecule or polypeptide for identifying substances 

capable of interacting with an adenosine kinase homologous 
polypeptide- 

16. A non-human transgenic animal exhibiting a modified expression of 
20 an adenosine kinase homologous polypeptide. 

17. The animal of claim 16, wherein the expression of the adenosine 
kinase homologous polypeptide is increased and/or reduced. 

25 18. A recombinant host cell exhibiting a modified expression of an 
adenosine kinase homologous polypeptide. 

19. The cell of claim 18 which is a human cell. 

30 20. A method of identifying a (poly)peptide involved in the regulation of 
energy homeostasis and/or metabolism of triglycerides in a mammal 
comprising the steps of 
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(a) contacting a collection of {poly)peptides with an adenosine 
^ kinase homologous polypeptide or a fragment thereof under 

conditions that allow binding of said (poly)peptides; 

(b) removing (poly) peptides which do not bind and 

(c) identifying (poly) peptides that bind to said adenosine kinase 
homologous polypeptide. 

A method of screening for an agent which modulates the interaction 
of an adenosine kinase homologous polypeptide with a binding 
target/agent, comprisirrg the steps of 

(a) incubating a mixture comprising 

(aa) an adenosine kinase homologous polypeptide or a 
fragment thereof; 

(ab) a binding target/agent of said adenosine kinase 
homologous polypeptide or fragment thereof; and 

(ac) a candidate agent 

under conditions whereby said adenosine kinase polypeptide 
or fragment thereof specifically binds to said binding 
target/agent at a reference affinity; 

(b) detecting the binding affinity of said adenosine kinase 
polypeptide or fragment thereof to said binding target to 
determine a (candidate) agent-biased affinity; and 

(c) determining a difference between (candidate) agent-biased 
affinity and reference affinity. 

A method of screening for an agent which modulates the activity of 
an adenosine kinase homologous polypeptide comprising the steps 
of 

(a) incubating a mixture comprising 

(aa) an adenosine kinase homologous polypeptide or a 
fragment thereof and 
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(ab) a candidate agent 

under conditions whereby said adenosine kinase polypeptide 
or fragment thereof exhibits a reference activity, 

(b) detecting the activity of said adenosine kinase polypeptide or 
fragment thereof to determine a {candidate) agent-biased 
activity and 

(c) determining a difference between (candidate) agent-biased 
activity and reference activity. 

23. A method of producing a composition comprising the {poly)peptide 
Identified by the method of claim 20 or the agent identified by the 
method of claim 21 or 22 with a pharmaceutlcally acceptable carrier, 
diluent and/or adjuvant. 

24. The method of claim 23 wherein said composition is a 
pharmaceutical composrtion for preventing, alleviating or treating of 
diseases and disorders, including metabolic diseases such as obesity 
and other body-weight regulation disorders as well as related 
disorders such as eating disorder, cachexia, diabetes mellitus, 
hypertension, coronary heart disease, hypercholesterolemia, 
dyslipidemia, osteoarthritis, gallstones, cancer, e.g. cancers of the 
reproductive organs, and sleep apnea and other diseases and 
disorders. 

25. Use of a (poly)peptlde as identified by the method of claim 20 or of 
an agent as Identified by the method of claim 21 or 22 for the 
preparation of a pharmaceutical composition for the treatment, 
alleviation and/or prevention of of diseases and disorders. Including 
metabolic diseases such as obesity and other body-weight regulation 
disorders as well as related disorders such as eating disorder, 
cachexia, diabetes mellitus, hypertension, coronary heart disease, 
hypercholesterolemia, dyslipidemia, osteoarthritis, gallstones, 
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cancer, e.g. cancers of the reproductive organs, and sleep apnea 
and other diseases and disorders. 

Use of a nucleic acid molecule of the adenosine kinase gene family 
or of a fragment thereof for the preparation of a non-human animal 
which over- or under-expresses the adenosine kinase gene product. 

Kit comprising at least one of 

(a) a adenosine kinase nucleic acid molecule or a fragment 
thereof; 

(b) a vector comprising the nucleic acid of (a); 

(c) a host cell comprising the nucleic acid of (a) or the vector of 
(b); 

(d) a polypeptide encoded by the nucleic acid of (a); 

(e) a fusion polypeptide encoded by the nucleic acid of (a); 

(f) an antibody, an aptamer or another receptor against the 
nucleic acid of (a) or the polypeptide of (d) or (e) and 

(g) an anti-sense oligonucleotide of the nucleic acid of (a). 
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Abstract 

The present invention discloses adenosine kinase homologous proteins 
regulating the energy homeostasis and the metabolism of triglycerides, and 
polynucleotides, which identify and encode the proteins disclosed in this 
invention- The invention also relates to the use of these sequences in the 
diagnosis, study, prevention, and treatment of diseases and disorders, for 
example, but not limited to, metabolic disorders and diseases such as the 
metabolic syndrome, including obesity, eating disorder, cachexia, diabetes 
meltitus, hypertension, coronary heart disease, hypercholesterolemia, 
dyslipidemia, osteoarthritis, gallstones, cancers of the reproductive organs, 
and sleep apnea. 
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